21 research outputs found

    Entwicklung einer computergestützten Methode zum reaktionsbasierten De-Novo-Design wirkstoffartiger Verbindungen

    Get PDF
    A new method for computer-based de novo design of drug candidate structures is proposed. DOGS (Design of Genuine Structures) features a ligand-based strategy to suggest new molecular structures. The quality of designed compounds is assessed by a graph kernel method measuring the distance of designed molecules to a known reference ligand. Two graph representations of molecules (molecular graph and reduced graph) are implemented to feature different levels of abstraction from the molecular structure. A fully deterministic construction procedure explicitly designed to facilitate synthesizability of proposed structures is realized: DOGS uses readily available synthesis building blocks and established reaction schemes to assemble new molecules. This approach enables the software to propose not only the final compounds, but also to give suggestions for synthesis routes to generate them at the bench. The set of synthesis schemes comprises about 83 chemical reactions. Special focus was put on ring closure reactions forming drug-like substructures. The library of building blocks consists of about 25,000 readily available synthesis building blocks. DOGS builds up new structures in a stepwise process. Each virtual synthesis step adds a fragment to the growing molecule until a stop criterion (upper threshold for molecular mass or number of synthesis steps) is fulfilled. In a theoretical evaluation, a set of ~1,800 molecules proposed by DOGS is analyzed for critical properties of de novo designed compounds. The software is able to suggest drug-like molecules (79% violate less than two of Lipinski’s ‘rule of five’). In addition, a trained classifier for drug-likeness assigns a score >0.8 to 51% of the designed molecules (with 1.0 being the top score). In addition, most of the DOGS molecules are deemed to be synthesizable by a retro-synthesis descriptor (77% of molecules score in the top 10% of the decriptor’s value range). Calculated logP(o/w) values of constructed molecules resemble a unimodal distribution centred close to the mean of logP(o/w) values calculated for the reference compounds. A structural analysis of selected designs reveals that DOGS is capable of constructing molecules reflecting the overall topological arrangement of pharmacophoric features found in the reference ligands. At the same time, the DOGS designs represent innovative compounds being structurally distinct from the references. Synthesis routes for these examples are short and seem feasible in most cases. Some reaction steps might need modification by using protecting groups to avoid unwanted side reactions. Plausible bioisosters for known privileged fragments addressing the S1 pocket of trypsin were proposed by DOGS in a case study. Three of them can be found in known trypsin inhibitors as S1-adressing side chains. The software was also tested in two prospective case studies to design bioactive compounds. DOGS was applied to design ligands for human gamma-secretase and human histamine receptor subtype 4 (hH4R). Two selected designs for gamma-secretase were readily synthesizable as suggested by the software in one-step reactions. Both compounds represent inverse modulators of the target molecule. In a second case study, a ligand candidate selected for hH4R was synthesized exactly following the three-step synthesis plan suggested by DOGS. This compound showed low activity on the target structure. The concept of DOGS is able to deliver synthesizable and bioactive compounds. Suggested synthesis plans of selected compounds were readily pursuable. DOGS can therefore serve as a valuable idea generator for the design of new pharmacological active compounds.Im Rahmen der vorliegenden Arbeit wird eine neue Methode zum computergestützten de novo Design von wirkstoffartigen Molekülen vorgestellt. Ziel ist es, automatisiert und zielgerichtet neuartige Moleküle mit biologischer Aktivität zu entwerfen. Das entwickelte Programm DOGS (Design of Genuine Structures) schlägt zusätzlich zu den chemischen Verbindungen mögliche Strategien zu deren Synthese vor. Ein vollständig deterministischer Konstruktionsalgorithmus verwendet verfügbare Synthesebausteine und etablierte chemische Reaktionen zum Aufbau der neuen Moleküle. Die Bibliothek der Synthesebausteine umfasst etwa 25.000 Moleküle mit einer molekularen Masse zwischen 30 und 300 Da. Die Sammlung der Reaktionen zur Verknüpfung der Bausteine besteht aus 83 literaturbeschriebenen chemischen Reaktionen. Ein Großteil stellt Syntheseschritte zur Generierung neuer Ringsysteme dar. DOGS baut neue Moleküle schrittweise auf: In jedem virtuellen Syntheseschritt wird ein neues Fragment an das wachsende Molekül angefügt, bis eines der Stoppkriterien (Überschreitung einer maximalen molekulare Masse oder Anzahl Syntheseschritte) erfüllt ist. Zur Bewertung der Qualität der Zischen- und Endprodukte wird eine ligandenbasierte Strategie verwendet. Die entstehenden Moleküle werden mit einem bekannten Referenzliganden verglichen, welcher die gewünschte biologische Aktivität aufweist. Das Verfahren zielt dabei auf die Maximierung der Ähnlichkeit der neu konstruierten Moleküle zur Referenz ab. Eine Graphkernmethode berechnet die Ähnlichkeit zum Referenzliganden anhand des Vergleichs ihrer zweidimensionalen molekularen Struktur. In einer theoretischen Auswertung des Programms werden ca. 1.800 generierte potentielle Trypsin-Inhibitoren hinsichtlich solcher Eigenschaften analysiert, welche für neu entworfene Verbindungen kritisch sind: DOGS ist in der Lage wirkstoffartige Moleküle zu entwerfen (79% verletzen weniger als zwei von Lipinskis 'rule of five' Kriterien zur Abschätzung der oralen Bioverfügbarkeit). Zusätzlich wurde die Wirkstoffartigkeit der DOGS-Moleküle durch einen trainierten Klassifizieralgorithmus bewertet. Hierbei erhielten 51% der Verbindungen einen Wert in den oberen 20% des Wertebereichs des Klassifizierers. Weiterhin wird die synthetische Zugänglichkeit für den Großteil der computergenerierten Moleküle als hoch eingeschätzt (77% erhalten einen Wert in den oberen 10% des Wertebereichs eines Deskriptors zur Abschätzung der Synthetisierbarkeit). Die berechneten logP(o/w) Werte der konstruierten Moleküle entsprechen in ihrer Verteilung denen der Referenzliganden. Die Untersuchung der vorgeschlagenen Trypsin-Inhibitoren auf Bioisostere zur Adressierung der S1-Bindetasche zeigt, dass hierfür plausible Vorschläge von DOGS generiert werden. Der Großteil ist potentiell in der Lage eine kritische ladungsvermittelte Interaktion mit dem Protein in der S1-Bindetasche einzugehen. Unter den Vorschlägen befinden sich unter anderem auch drei Seitenketten, für die Interaktionen mit der S1-Bindetasche von Trypsin experimentell bestätigt sind. Eine Analyse ausgewählter Beispiele aus verschiedenen Läufen zum Ligandenentwurf für unterschiedliche biologische Zielmoleküle zeigt, dass das Programm in der Lage ist, die generelle topologische Anordnung potentieller Interaktionspunkte der Referenzliganden in den neu erzeugten Molekülen beizubehalten. Gleichzeitig sind diese Moleküle strukturell verschieden im Vergleich zu den Referenzliganden. Die generierten Synthesewege sind kurz und erscheinen in den meisten Fällen plausibel. Für einige der Syntheseschritte wird bei der praktischen Umsetzung der ergänzende Einsatz von Schutzgruppen notwendig sein, um unerwünschte Nebenreaktionen zu vermeiden. Die Software wurde zusätzlich zu den theoretischen Analysen in prospektiven Studien zum Ligandenentwurf praktisch evaluiert. Hierzu wurde DOGS zur Generierung von Liganden des humanen Histaminrezeptors 4 (hH4R) sowie der humanen gamma-Sekretase eingesetzt. Für hH4R wurde einer der entworfenen potentiellen Liganden synthetisiert, wobei der vorgeschlagene Syntheseweg exakt nachvollzogen werden konnte. Der Ligand weist eine geringfügige Affinität zum Histaminrezeptor auf. Für die gamma-Sekretase wurden zwei der entworfenen Moleküle zur Synthese und Testung ausgewählt. In beiden Fällen konnte auch hier die von DOGS vorgeschlagene Synthesestrategie nachvollzogen werden. Anschließende in vitro Analysen wiesen beide Verbindungen als inverse Modulatoren der gamma-Sekretase aus. Das Konstruktionskonzept von DOGS ist in der Lage, bioaktive Substanzen vorzuschlagen. Diese sind synthetisch zugänglich und können nach der vorgeschlagenen Strategie synthetisiert werden. Somit kann das Programm als Ideengenerator für den Entwurf neuer bioaktiver Moleküle dienen

    DOGS: Reaction-Driven de novo Design of Bioactive Compounds

    Get PDF
    We present a computational method for the reaction-based de novo design of drug-like molecules. The software DOGS (Design of Genuine Structures) features a ligand-based strategy for automated ‘in silico’ assembly of potentially novel bioactive compounds. The quality of the designed compounds is assessed by a graph kernel method measuring their similarity to known bioactive reference ligands in terms of structural and pharmacophoric features. We implemented a deterministic compound construction procedure that explicitly considers compound synthesizability, based on a compilation of 25'144 readily available synthetic building blocks and 58 established reaction principles. This enables the software to suggest a synthesis route for each designed compound. Two prospective case studies are presented together with details on the algorithm and its implementation. De novo designed ligand candidates for the human histamine H4 receptor and γ-secretase were synthesized as suggested by the software. The computational approach proved to be suitable for scaffold-hopping from known ligands to novel chemotypes, and for generating bioactive molecules with drug-like properties

    Repurposing de novo designed entities reveals phosphodiesterase 3B and cathepsin L modulators

    No full text
    Using computational bioactivity prediction models we identified phosphodiesterase 3B (PDE3B) and cathepsin L as macromolecular targets of de novo designed compounds. By disclosing the most potent cathepsin L activator known to date, small molecule repurposing by target panel prediction represents a feasible route towards innovative leads for chemical biology and molecular medicine

    A collection of robust organic synthesis reactions for in silico molecule design

    No full text
    A focused collection of organic synthesis reactions for computer-based molecule construction is presented. It is inspired by real-world chemistry and has been compiled in close collaboration with medicinal chemists to achieve high practical relevance. Virtual molecules assembled from existing starting material connected by these reactions are supposed to have an enhanced chance to be amenable to real chemical synthesis. About 50% of the reactions in the dataset generate new rings in order to enhance assembly of novel ring systems and innovative chemotypes. A comparison with a recent survey of the reactions used in early drug discovery revealed considerable overlaps with the collection presented here. The dataset is available encoded as computer readable SMIRKS expressions from the supporting information

    Probing the bioactivity-relevant chemical space of robust reactions and common molecular building blocks.

    No full text
    In search for new bioactive compounds, there is a trend towards increasingly complex compound libraries aiming to target more demanding targets of the future. In contrast, medicinal chemistry and traditional library design rely mainly on a small set of highly established and robust reactions. Here, we probe a set of 58 such reactions for their ability to sample the chemical space of known bioactive molecules, and the potential to create new scaffolds. Combined with ~26,000 common available building blocks, the reactions retrieve around 10% of a scaffold-diverse set of compounds active on human target proteins over all major pharmaceutical target classes. Enabling new discoveries, almost 80% of newly generated scaffolds from virtual one-step synthesis products are not present in known bioactive molecules for human targets. Our study indicates that established synthesis resources are well suited to cover broadly the known bioactivity relevant chemical space and that there is plenty of unexplored chemical space addressable by these reactions, possibly providing valuable “low-hanging fruits” for hit discovery

    Probing the Bioactivity-Relevant Chemical Space of Robust Reactions and Common Molecular Building Blocks

    No full text
    In the search for new bioactive compounds, there is a trend toward increasingly complex compound libraries aiming to target the demanding targets of the future. In contrast, medicinal chemistry and traditional library design rely mainly on a small set of highly established and robust reactions. Here, we probe a set of 58 such reactions for their ability to sample the chemical space of known bioactive molecules, and the potential to create new scaffolds. Combined with ∼26 000 common available building blocks, the reactions retrieve around 9% of a scaffold-diverse set of compounds active on human target proteins covering all major pharmaceutical target classes. Almost 80% of generated scaffolds from virtual one-step synthesis products are not present in a large set of known bioactive molecules for human targets, indicating potential for new discoveries. The results suggest that established synthesis resources are well suited to cover the known bioactivity-relevant chemical space and that there are plenty of unexplored regions accessible by these reactions, possibly providing valuable “low-hanging fruit” for hit discovery

    Property distributions.

    No full text
    <p>Comparison of property distributions between compounds designed by DOGS (<i>left</i>) and the reference compounds (<i>right</i>). ‘Rule of 5’ violations (<b>A</b>) and log<i>P</i>(o/w) values (<b>C</b>) were calculated using MOE. Dug-likeness scores (<b>B</b>) were computed by a trained neural network classifier (1 = high drug-likeness).</p

    Bioisosteric replacement.

    No full text
    <p>Side-chains addressing the S1 pocket present in the reference compounds (<i>left</i>) and surrogates suggested by DOGS found in top-scored 200 designs (<i>right</i>).</p

    Structural features of H<sub>4</sub>R ligands.

    No full text
    <p>Highlighted features of known H<sub>4</sub>R ligands (compound <b>8 </b><a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1002380#pcbi.1002380-Sander1" target="_blank">[40]</a>: central triazole ring; compound <b>9 </b><a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1002380#pcbi.1002380-LeeDutra1" target="_blank">[41]</a>: ether alkyl linker) are combined in designed compound <b>7</b>. None of these features is present in the reference ligand underlying the design of <b>7</b>.</p
    corecore